Accurate discovery of expression quantitative trait loci under confounding from spurious and genuine regulatory hotspots.
نویسندگان
چکیده
In genomewide mapping of expression quantitative trait loci (eQTL), it is widely believed that thousands of genes are trans-regulated by a small number of genomic regions called "regulatory hotspots," resulting in "trans-regulatory bands" in an eQTL map. As several recent studies have demonstrated, technical confounding factors such as batch effects can complicate eQTL analysis by causing many spurious associations including spurious regulatory hotspots. Yet little is understood about how these technical confounding factors affect eQTL analyses and how to correct for these factors. Our analysis of data sets with biological replicates suggests that it is this intersample correlation structure inherent in expression data that leads to spurious associations between genetic loci and a large number of transcripts inducing spurious regulatory hotspots. We propose a statistical method that corrects for the spurious associations caused by complex intersample correlation of expression measurements in eQTL mapping. Applying our intersample correlation emended (ICE) eQTL mapping method to mouse, yeast, and human identifies many more cis associations while eliminating most of the spurious trans associations. The concordances of cis and trans associations have consistently increased between different replicates, tissues, and populations, demonstrating the higher accuracy of our method to identify real genetic effects.
منابع مشابه
A Robust Statistical Method for Association-Based eQTL Analysis
BACKGROUND It has been well established that theoretical kernel for recently surging genome-wide association study (GWAS) is statistical inference of linkage disequilibrium (LD) between a tested genetic marker and a putative locus affecting a disease trait. However, LD analysis is vulnerable to several confounding factors of which population stratification is the most prominent. Whilst many met...
متن کاملRegulatory hotspots are associated with plant gene expression under varying soil phosphorus supply in Brassica rapa.
Gene expression is a quantitative trait that can be mapped genetically in structured populations to identify expression quantitative trait loci (eQTL). Genes and regulatory networks underlying complex traits can subsequently be inferred. Using a recently released genome sequence, we have defined cis- and trans-eQTL and their environmental response to low phosphorus (P) availability within a com...
متن کاملIdentification of Key Causal Regulators in Gene Networks
One primary goal of gene network analysis is to identify key regulatory components, or key drivers, of sub-networks with respect to various biological contexts. Here we developed a general algorithm to identify key drivers in gene regulatory networks. The generalized key driver analysis (KDA) uncovers not only the well-known regulators for the expression quantitative trait locus (eQTL) hotspots...
متن کاملRegulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus
Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of ge...
متن کاملExpression quantitative trait loci: replication, tissue- and sex-specificity in mice.
By treating the transcript abundance as a quantitative trait, gene expression can be mapped to local or distant genomic regions relative to the gene encoding the transcript. Local expression quantitative trait loci (eQTL) generally act in cis (that is, control the expression of only the contiguous structural gene), whereas distal eQTL act in trans. Distal eQTL are more difficult to identify wit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genetics
دوره 180 4 شماره
صفحات -
تاریخ انتشار 2008